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Abstract — Social scientists have observed that human behavior 
in society can often be modeled as corresponding to a threshold 
type policy. A new behavior would propagate by a procedure 
in which an individual adopts the new behavior if the fraction 
of his neighbors or friends having adopted the new behavior 
exceeds some threshold. In this paper we study the question of 
whether the emergence of threshold policies may be modeled 
as a result of some rational process which would describe the 
behavior of non-cooperative rational members of some social 
network. We focus on situations in which individuals take the 
decision whether to access or not some content, based on the 
number of views that the content has. Our analysis aims at 
understanding not only the behavior of individuals, but also 
the way in which information about the quality of a given 
content can be deduced from view counts when only part of the 
viewers that access the content are informed about its quality. 
In this paper we present a game formulation for the behavior of 
individuals using a meanfield model: the number of individuals 
is approximated by a continuum of atomless players and for 
which the Wardrop equilibrium is the solution concept. We derive 
conditions on the problem's parameters that result indeed in the 
emergence of threshold equilibria policies. But we also identify 
some parameters in which other structures are obtained for the 
equilibrium behavior of individuals. 

Index Terms — User-generated content, Complex Systems, 
Video popularity, Game theory, Wardrop equilibria 

I. Introduction 

Online media constitute currently the largest share of Inter- 
net traffic. A large part of such traffic is generated by platforms 
that deliver user-generated content (UGC). This includes, 
among the other ones, YouTube and Vimeo for videos, Flickr 
and Instagram for images and all social networking platforms. 

Among such services, a prominent role is played by 
YouTube. Founded in 2005 by Chad Hurley, Steve Chen and 
Jawed Karim and acquired in 2006 by Google, YouTube scored 
in 2011 more than 1 trillion views (or, alternatively, an average 
of 140 video views for every person on Earth), with more than 
3 billion hours of video watched every month and 72 hours 
of video uploaded every minute by YouTube's userfl 

Of course, not all videos posted on YouTube are equal. 
The key aspect is their "popularity", broadly defined as the 
number of views they score (also referred to as viewcount). 
This is relevant from a twofold perspective. On the one hand, 
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more popular content generates more traffic, so understanding 
popularity has a direct impact on caching and replication 
strategy that the provider should adopt. On the other one, 
popularity has a direct economic impact. Indeed, popularity 
or viewcount are often directly related to click-through rates 
of linked advertisements, which constitute the basis of the 
YouTube's business model. 

Recently, a number of researchers have analysed the evo- 
lution of the popularity of online media content (TJ, Q, 
E), 10, 0, with the aim of developing models for early-stage 
prediction of future popularity Q. 

Such studies have highlighted a number of phenomena that 
are typical of UGC delivery. This includes the fact that a 
significant share of content gets basically no views J6), as 
well as the fact that popularity may see some bursts, when 
content "goes viral" |4j- Also, in Q the authors demonstrate 
that after an initial phase, in which contents gain popularity 
through advertisement and other marketing tools, the platform 
mechanisms to induce users to access contents (re-ranking 
mechanisms) are main drivers of popularity. 

In this paper, we address such phenomena, by developing 
a model, based on game theoretical concepts and tools, for 
understanding how user's behaviour drives the evolution of 
popularity of a given content. The work is based on rational 
decision-making assumptions, whereby the users have to de- 
cide whether to see a given content or not. This configures as 
a game, where users seek to maximize some expected utility 
based on their "perception" of the quality of the contend and 
on viewcount. However, users suffer also a cost for accessing 
contents of bad quality, i.e., waste of time and possibly 
bandwidth, batteries, etc. In particular, in the decision process 
the viewcount is used as a noisy estimator of the quality 
of a content. Interestingly, this context resembles closely the 
situation in the economic domain, where customers of a firm 
which are uninformed do infer the quality of products from 
the length of the queue they encounter upon requesting firm's 
goods to purchase 0. 

Extensive advertising and marketing campaigns can be used 
to push the viewcount of a given content up. And in the deci- 
sion making process users do not know whether the viewcount 
has been "pushed" by such means. Also, the decisions made 
by different users influence the viewcount and consequently 
the decisions made by other users, a process which suits well 
the usage of game theoretical machinery. 

Specifically, we describe the conditions for the adoption of 

2 This may come, e.g., from the name of user who posted the content. 



common behaviors in online content access. This is inspired 
by findings in social science (9), iflOl . ifTTI : results there show 
that emerging behaviours would propagate by a procedure in 
which an individual adopts a novel behavior if the fraction 
of neighbors or friends having adopted the same behavior 
exceeds some threshold. In our context, the threshold would 
be expressed in terms of viewcount or related metric. 

In the sense of game theory, users of online media repre- 
sent non-cooperative rational players connected through some 
social tie, e.g., being users of the same UGC platform. Since 
we consider systems composed by a very large number of 
users, the customary tool to study the user behaviour is that 
of Wardrop equilibria lTT2l . In particular, we have found a 
number of conditions for which such equilibria exist and can 
be characterized analytically. Explicit conditions were found 
for content to stay at zero views or to become so popular that 
it is makes sense for all users to access it the sooner the better. 

Furthermore, we identify, for the general case, conditions 
under which players tend to accrue around a common strategy 
depending on initial conditions. This is due to the existence of 
a continuum of equilibria: the system will settle at any point 
very much depending on initial conditions imposed, for in- 
stance, by a set of forerunners which cause significant changes 
of the content popularity. Such conditions were identified in 
early works such as |[T3l in other contexts: there, the authors 
applied threshold type Nash equilibrium strategies in which 
one purchases priority if and only if upon arrival the queue size 
is larger than some threshold value. Key motivation in lfl3l is 
predictability and control of purchase priority. What motivates 
this work is predictability and control of online content access. 

Novel contribution: in this paper, we move away from the 
classical analysis of social networks in the spirit of Q, H, 
G), H): instead, we provide a first analysis based on games. 
The aim of this paper is to provide a novel perspective where 
contents compete to gain popularity and are subject to the 
effect of user's choice. To the best of the authors' knowledge, 
this is the first attempt so far to describe content popularity in 
UGC systems using game theoretical tools. 

The remainder of the work is organized as follows. In 
Sec. [n] we introduce the system model and the notation 
used throughout the paper. Results for the case when plain 
viewcount is used to make decisions are presented in Sec. Hn] 
When decisions account also for a large increasing trend of 
content popularity, i.e., looking for 'hot' content, the dynamics 
of the game becomes different. This case is analysed in 
Sec. |lVl In Sec. [V] we analyze the joint effect when both 
the viewcount and its trend are both relevant to the user. 
Finally, in Sec. |VT] we model the effect of side information 
when users have some measure of future content dynamics. 
Sec. I VIII reviews the related work and Sec. IVIIII concludes the 
paper highlighting directions for expanding the current reach 
of the work. 

II. System Model 

We consider contents made available to a user by means of 
YouTube or a similar platform. We denote by r the lifetime of 
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(b) "Chris Sharma Worlds' First 5.15" 
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(d) "Shakira - Waka-Waka" 
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(e) "Bruno Mars - Grenade" 




(f) "Adele - Rolling in the deep" 

Fig. 1: Dynamics of the viewcount for six sample videos: the push 
dynamics can be identified with the first part of the dynamics, where 
labels identify some actions that are significant for the diffusion of 
the video; observe for cases a, b and c how a linear dynamics takes 
over in the last part of the dynamics. The labels tagging the first part 
of the dynamics mention specific events that identify the diffusion of 
the content on specific platforms or channels. 



a content, i.e., the time horizon during which the content bears 
some interest. In general, such horizon differs depending on 
the type of content: it can be typically of the order of weeks 
to months for YouTube videos or a few days for news Q. A 
possible extension to the case of variable time horizon is the 
addressed in Sec. |IV] 

We denote by X(t) the viewcount attained by a given 
content 6 at time t seconds after it has been posted, for 
< t < T. 

As in standard UGC platforms, there are two mechanisms 
that coexist and can jointly increase the viewcount: 

• push: the content provider exploits some preferential 
channels (including paid advertisement either directly on 
the UGC system or via social networking platforms) to 
make users aware of the content and to induce them to 
access it. We call push users the users that access the 
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content as a reaction to the push mechanism. 

> pull: users find about the content through standard search 
and decide to access it based on the belief that the content 
is relevant for them. We call users accessing a content 
through the pull mechanism pull users. 

In practice, many YouTube videos are subject to the push 
and the pull mechanisms described above such as the examples 
that we reported in Fig. Q] For instance, Fig. QJ, shows the 
dynamics of a popular video with viewcount X > 675000. 
The YouTube statistics associated with the video describe 
explicitly a series of events happening in the first part of the 
dynamics of X. For instance, the event B that appears around 
02/12/2012, is precisely the event , 'First embedded 
on: plus.google.com' ' which indeed configures as a 
push towards a social network platform. After the initial push, 
such events vanish, and the rest of the dynamics appears 
ascribed mostly to the pull mechanism defined above, with 
a linear increase in the viewcount. 

Also, some of the reported videos are representative of a 
specific class of online contents, which are those we will be 
dealing with in the rest of the paper. We can refer to those as 
the contents that comply to the exponential-linear model, for 
the sake of brevity. In particular, many such contents appear 
to obey to the following dynamics: after an initial exponential 
growth, the increase of the viewcount becomes linear. The way 
to interpret such a behavior can be traced to the notion of push 
and pull mechanisms described above: the exponential growth 
corresponds to actions through which the source distributes 
the content within a basin of target push viewers. When such 
basin is finite and small with respect to the content diffusion 
dynamics, the viewcount dynamics experiences a saturation 
effect which takes over after an initial phase. However, at 
that stage, the access to the content is due to pull users 
that come across the content browsing online: they do so at 
random from a very large basin, so that the access rate, i.e., 
the viewcount increase rate, is linear. These combined effects 
are visible in the case of the first two videos, i.e. Fig QJ and 
Fig QJ. In the case of the first video, the saturation effect is 
well visible, whereas in the case of the second one the linear 
increase following the saturation is dominating. The example 
in Fig QJ is a case where all the dynamics is linear with 
good approximation: as it will be clear in the following, in the 
exponential-linear model this case is represented when either 
the basin of push users is large or when the rate at which 
contents are pushed is small. 

Remark 1: Not all videos will diffuse according to the 
proposed exponential-linear model. For instance, there exist 
cases when the initial viewcount dynamics displays a charac- 
teristic sigmoid shape. We reported in Fig[T]l,e,f the viewcount 
dynamics for three popular music videos: in those cases the 
dynamics resembles the logistic curve associated to the spread 
of epidemics. We can ascribe such similarity to the presence 
of a positive feedback in the push mechanism, e.g., those who 
access the content have some mean to recommend the content 
for others to access it, through targeted recommendation or 
similar mechanism. When a social network is present, this 
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Fig. 2: The reward or the cost of content 9 for a tagged user is 
represented by the time during which the content can be accessed, 
i.e., when viewcount is larger than threshold j3. 



may happen due to the push of the content into the neighbor- 
hood of those who view the content. A similar and perhaps 
more powerful feedback effect can happen between different 
channels on the same platform, e.g., YouTube channels, and 
across different platforms through the recommendation list that 
is presented to the platform users. 

This also qualifies the type of exponential-linear dynamics that 
we consider as those for which this type of feedback does not 
play a significant role. In particular, in the case of Fig QJ, 
the content is of interest at the national scale in the US, and 
the viewers are likely driven to the content by general search 
criteria (e.g., typing in a search engine). Also, in the case 
of Fig [Tt, the viewers are likely those who browse for some 
specific recipe, whereas in the case of Fig [TJi viewers are 
interested in a niche sport, where the event is known within 
the reference community. In all such cases we see that the 
linear part of the dynamics takes over and becomes dominant. 

Game model 

In our model, we are interested in the uptake of the pull 
users. Pull users interested in the given content do not know in 
advance its quality. They may discover it during interval [0, r) 
at random. Their estimation of the interest/potential quality is 
based on the viewcount X. In the simplest case, contents with 
higher viewcount are more likely to be accessed. 

We define by X ps (t) the number of push users accessing 
the content up to time f as a reaction to the push mechanism 
and, analogously, by X pu (t) the number of those accessing it 
through the pull mechanism. Clearly, X(t) = X ps (t)+X pu (t). 

Users have beliefs about the quality of the content. We 
denote by -kq the belief that a given content is good (i.e., 
of interest or anyway worth accessing) and, conversely, by 
ix b = 1 — ttq the belief that the content is bad. We denote 
by 7r = (ttg^b) the corresponding distribution. Stating 
txq = 0.75 means that a user believes that every 4 similar 
contents she would get 3 good ones and 1 bad one. 

The content access configures as a game where we define 
players, strategies and utilities. Players: the players are pull 
users: based on their belief 7r, they may access the content 9 
or not. 

Strategies: they access 9 when the viewcount is above a 
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certain threshold, i.e., X{t) > 13 > 00 Hence, the strategy for 
a certain user is the viewcount threshold j3 > 0. Of course, 
all other players also adopt their own strategy with respect to 
9 and we denote a. the vector of strategies of all remaining 
users: a is a vector of viewcount thresholds for all other users. 

Utilities: users face either a cost C or a reward R for playing 
strategy f3: the cost and the reward is the fraction of lifetime 
when the content is in the viewcount range, i.e, when they are 
willing to access it. The rationale to define this cost/reward is 
the following. Let a good content be worth one unit reward, 
and a bad content worth a unit cost. The user may hit several 
similar contents at random over time. If they are good, the 
fraction of those actually accessed will be proportional to 1 — 
— , where we define tp = min{t | /3 = X(t)}, i.e., tp is the 
smallest instant when the threshold is achieved. That also is 
going to be the long term reward, or the cost, for accessing 
similar online contents. Formally, 

R(a, 0, G) = (t- tp(G)) + , C(a, (3, B) = (r - t fj {B))+ 

Finally, based on their belief 7r, players expect a utility when 
playing (3 that amounts to 

U(a, /3) = n G R(a, /3, G) - 7r B C(a, f3, B) 

According to the above expression, the cost and the reward 
are a function of the interval when the content is above the 
threshold, i.e., when the users can benefit from it, and depends 
on the other players strategy. Furthermore, the action taken by 
players depends on their belief on the quality of the content. 

In the following we will investigate symmetric equilibria, 
i.e., equilibria for which all users play a > 0. We can hence 
adopt a simplified scalar notation and define t a = mm{t\a = 
X(t)}. 

Let a tagged user playing f3 when all the remaining users 
use a: we make the assumption that Wardrop conditions holds. 
Namely, for a large number of users any unilateral deviation 
of a single user does not affect the utilities of other users. 
I.e., deviations due to a single user action are negligible. 
Wardrop equilibria are much easier to compute than the Nash 
equilibrium; however, Wardrop is a good approximation for 
the latter, as in [1410 

The tagged user expects to gain a certain reward R(a, /3, G), 
for a good content and expects to suffer a cost C(a,f3,B) 
when the content is bad: under which conditions a is the 
best response to itself, namely (3*(a)l We answer to this 
question in the next sections under different knowledge of the 
viewcount dynamics available to users. 

Before we introduce our analysis, we recall that the utility 
function has the following expression for (3 > (3 t ,b 

if p > p r>G 

7T G (T-tp(G)) if P T ,B<0<P T ,G 



U(a,(3) = 



3 We consider the reference case when players select based on the viewcount 
only for the sake of explanation. We will extend the model to other interesting 
cases in next sections. 

4 A traditional application of Wardrop equilibria is road traffic, where users 
tend to settle to routes minimizing their delay: the effect of a route change 
of an individual driver belonging to a flow is negligible system-wide to the 
utilities of other users. 



where f3 T ,e is solution of the following equation 



(1) 



We observe that the utility function U is nonincreasing for 
P > Pt.b- However the best response j3* (a) can be found only 
in the interval [0,/3 Ti b]. As a result we restrict our analysis 
to case when f3 < j3 Tt B in which the utility function can be 
expressed as 

U(a, (3) — ttq (t tp{G)) -7T B (r- tp(B)) 

III. Plain Viewcount 

The basic model that we introduce in this section is based 
on the assumption that pull users rely on the number of hits 
of the contents to judge if it is worth to access it or not, i.e., 
they judge based on how many users accessed it. Thus, they 
play based on the dynamics. We hence specialize our analysis 
to two cases. 

A. Linear case 

First, we examine the case when the process of diffusion 
of contents is linear. This is the case when the time scale of 
the content diffusion is very large compared to the pool of 
potential users. A mechanism that that is able generate such 
a dynamics is the combined effect of an advertisement which 
is broadcasted to a very large pool of viewers, e.g., covering 
newspapers or other general audience media, and people so 
made aware of the existence of the content who decide to 
access the content with some random delay thereafter. 

Thus, we let X ps (t, 9) = X ps t-l(t) where l(t) is the unitary 
step function, and X pu (t, 9) = X pu {t — t a ) ■ l(t — t a jf]. 

Observe that in this case X ps — X ps (9), whereas X pu is 
independent of 9. In fact, we assume pull users judge based 
on viewcount only 0. However, we assume that X ps (G) > 
X ps {B). 

Lemma 1: In the linear case, under the assumption 
X pu {G) > X pu (B), it holds 

i. if ^frrr > 



j-^G) - a^(b)' then F(a)=0. 
ii. if < ^fW but 



X p3 (G) - Aps(B) 

(3* (a) = a 



Ap s {G)-\-X pu 



iii. if 



< 



but 



> 



A pa (G) + A pu < A ps (B) + A 



A P s (B)-t-A ptl 



, then 



then 



Ap„(G) - Ap a (B) 

Proof: We need to distinguish two cases, namely a > (3 
and a < (3, determine the best response for each case, and 
then by comparison choose the best response (3* = (3* (a). 
The expression for the utility in the two cases follows. 

If a > /3, then X(t,6) = X ps {t,9) for < t < tp. Thus, 
we can write simply 

a /3 

t n — rrr, tp 



X ps {9) 

and the expression for the utility 

U(a,f3)=T(n G -KB)-!3( 



X ps (9) 



kb 



X ps (G) X ps (B) 



(2) 



5 In a single source diffusion model, for instance, X = N(l— exp(— Xt)) 
NXt + o(t) 
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If a < j3, then X(t,9) = X ps (t,9) for < t < t a and 
X(t, 9) = X ps (t, 9) + X pu (t, 9) for t a < t < tp. In this case, 

t a t /3 ~ a 

X pu (9) Xp S (9) + X pu 

and in turn 

U(a,f3) = t(tt g - -k b ) - a( 



so that 



7T G 



"B 



-C8-«)( 



Ap S (G) Xps(B) 



X ps (9) + X pu \p S (9) + A 



(3) 



Now, we can distinguish the three statements in the claim: 



a ^(G) — a ^75) : * n tne ^ rst case ' ^ ue to linearity, /3 = 
maximizes the utility; in the second case, we observe that 
indeed it must hold hq > ttb, and then 



TT G \p S (B) - lT B \ps{G) > > Xpui^B - TTg) 



so that -r — .'I?, , — > -r — , pf, . — : in turn the utility function 
is maximized again if p = 0. Hence, it holds (3* (a) = 0. 

ii. In the first case, it is optimal to maximize (3, which brings 
(3 = a. In the second case, in turn it is optimal to minimize 
/3, so that again j3 = a. Hence, f3* (a) = a. 

iii. In the first case, the best response is the same as in ii. In 
the second case, instead, it is optimal to maximize /3, so that 
again (3 = /3 t .b- However, the last term of © is positive and 
(3 = f3 T ,B maximizes it. Also, by comparison with (0, indeed 
/3*(a) = /3 t .b in this case. ■ 

The above results provide a characterization of the possible 
symmetric Wardrop equilibria of the system. 

Theorem 1: i. if A 1i jq\ > A ^jm 1 tnen i s a symmetric 

Wardrop equilibrium 
ii if < ZLp hut 



> 



< (3 < (3 t b are symmetric Wardrop equilibria 



,then 



iii. if 



< 



but 



< 



A ps (G) - X pB (B) \ps(G) + \ pu ^ A ps (B) + A pl , 

/3 t _b is a symmetric Wardrop equilibrium 
It is possible to interpret the above result as follows: 



then 



A ps (G) 

represents the time pace at which push users are believed to 



access a good content. Similarly 



represents the time 



A ps (B) 

pace at which push users are believed to access a bad content. 
Thus, condition i. suggests that it is always convenient to 
anticipate the access to the content. In case ii., the situation is 
dictated by the uptake of pull users, because they increase the 
viewcount thus reinforcing the believed viewcount pace of a 
good content against that of a bad content. Finally, in case iii. 
there is no incentive in accessing the content. 

B. Exponential case: fixed time horizon 

Let us consider the content dissemination process operated 
by a content provider using a finite set of potential target 
users. After the content is posted by the provider directly to 
users, it will be transmitted to more and more users by using 
some preferential channels. In this case, we need to model the 
push dynamics accounting for the size N of the pool of push 
users, i.e., we assume that the content provider disseminates 
the content according to 

X ps (t,9) = \ ps (9)(N - X ps (t,9)), 



X ps (t,9) = N{l-e 



-A ps (0)t\ 



for t > 



(4) 



We reported in Fig. [3] the shape of the utility function under 
the exponential case for a fixed time horizon. As it can be 
observed in case a), for smaller values of a, i.e, a = 400 a 
low value of the belief ttq causes the access to be delayed till 
time r, whereas for increasing values of ttq we observe first 
a local maximum at a {ir g = 0.75), and finally the strategy 
(3 = takes over corresponding to very large values of ttq- 
Indeed, such a behavior of the utility function resembles - for 
a fixed N - what we observed in the linear case. However, at a 
closer look, namely in Fig. [3};) we understand that the situation 
is more elaborate: in particular, we know that number of push 
users N impacts the speed at which the viewcount increases. 
As such, a small N does not permit to pass the threshold a, 
whereas a very large one incentivizes early access: recall that 
/3max := f3 T ,B means access at time t = 0. In between, the 
presence of a maximum predicts, as in the linear case, the 
existence of best responses that lie in the interior of [0, /3 m ax]- 
This intuitive numerical insight is confirmed by the theoretical 
results that we detail in the following. 

We distinguish two cases, namely a < (3 and (3 < a. 

If /J < a, we have 



Hence the utility becomes 

U(a, (3) = (n G -7t b )t + log (l - A) ( 



k l0g (^^) 



Ap S (#) 



7!"G 



7TB 



X ps (G) Xps(B) 



Let (31(a) (resp. /^(a)) be the best response to a in [0,a] 
(resp. [a,f3 max ]) 

Lemma 2: In the exponential case, under the assumption 

Xp S (G) > X ps (B), it holds for /3 < a 

• If ft <fe{§ then a 

• If ff >fef|ythen/3 1 *(a) = 

. If 22- = then for every (3{ e [0,a] is optimal 

Proof: The proof is similar to the one developed in the 
linear case for (3 < a. ■ 
Now, we study the second case: a < f3. If t a < t < tp, 

X(t, 9) = N(l - cxp(~X ps (9)t) + \m(t - t a ) (5) 



for which we obtain 



tp =X ps (9)(w( 



X ps (9) N e— N(1 -^ 



X 



log 



/'(/ 



(1-*) 



(6) 



where W(-) is the Lambert function 0151 . We can obtain the 

^ ( ) ( 1 ^ 

derivative of the above expression by letting = — r— 
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(a) Case a = 400 (b) Case a = 700 (c) a = 700, increasing N 



Fig. 3: The utility function for N = 1000, for r = 10 days, X ps {G) = 10 _1 views/day, X pB (B) = X ps (G)/W. a) a = 400 views, b) a = 700 
views. Increasing values of the belief no determine different shapes for the utility function, c) Increasing values of N — 700, 1000, 50000 
for a = 700. All graphs for X pu = 1.5NX ps (G). 



and ((6) 



N 



1 



d_ 

dp tp ~~ \ ps (8)df3 
1 1 



tnc(w,0))-iog(e(/3)) 



\u i + w(C(W,0)) 

After some cumbersome algebra, we derive 

Lemma 3: In the exponential case, under the assumptions 
Xp S (G) > X ps {B) and X ps (G)N < X pu , for a < /3 it holds 

• If 7tg < 7t_b then ^(a) = /3 r B 

If l±gM$M > ^ for al i £ e [ Q ,^ B ] th en 



l+W(C(B)£(a,B)) 

/3|(a) = a 



If 



l+W(gG)H(l3 T ,G)) 



< ff- for all /3 e [a,/3 T (B)] then 



l+W(C(B)C(ft ail ,B)) 

otherwise /?2 (a) is the solution of the following equation 

l + W(gG)qf3* 2 (a),G)) = ttg 
l + W(C(B)€(/95(a),S)) ttb 

Proof: The derivative of the utility function f7 is 



t/'(a,/3) = -L( 



"B 



"B 



Ku V W(C(<3)£(/3, fl)) W(C(G)e(/3, G)) 

(7) 

Since £(/?, G) > £(/?, B) and C(G) > ((B) then it is easy to 
check under condition ttq < ttb that U'(a, (3) > 0. Hence the 
utility function attains a unique maximum at /3 t> b- 

In order to complete the proof, it is sufficient to show that 
the function U is either non-increasing, or there is some f3 
such that U is non-decreasing for (3 < (3 and non-increasing 
for 0>p. 

Assume that there exists a /3 such that U'(a, f3) < 0. From 
(0, it is sufficient to show that 

U'(a,f3)<0 for all /3 > f3 

We can show the above propriety by letting W((3) = 



1 + W(C(GK(P,G)) , ■ , 
l+W(C(B)t(0,B)) and 11 mmS OUt ttlat 

dW(/3) 1 



0/3 (l + W(((B)af3,BW 
K{B)W{C(B)i{(3, B))(l + W(((G)a(3, G))) 
V l + W(((B)ai3,B)) 

gG)W(C(G)W, G))(l + W(C(B)g/3, B))) \ 
l + W(aG)t(p,G)) ) 

To show dV ^^ < 0, we impose the inequality 

C(B)W(C(B)C(fi,B)) < C(g)^(C(g)^(/3,-B)) (g) 



(1 + ^(C(B)C(AB))) 2 " (1 + W(C(G0^G8,G))) S 

We can obtain the above inequality under assumption 
\ S (G)N < X pu by letting 



Hence the derivative of / can be expressed as 

df ,.y{y) + w{y){l - y(l -I)) +2 + y(l - &) 



dy- =Wm ~ 



(9) 

where y — y e "^_^' ■ In fact it can be showed that / is positive 
for y(l - $, ) < fie., X ps (G)N < \ pu . ■ 
Overall, the above cases are summarized in the following 
theorem 

Theorem 2: Let X ps (G) > X ps (B) and X ps (G)N < X pu , 
then in the exponential case 

i) If ttq < ttb then f3 T B is a symmetric Wardrop equilib- 
rium 

ii) If ttq > ttb then the following cases hold 



a) If 



< 



and 



l+W(((G)Z(a,G)) > ttg 



for all 



ttb ^ \ PS (B) W(C(B)i(a,B)) - 7T B 

(3 £ [a, Pt,b\ then all < f3 < Ty B are symmetric 
Wardrop equilibria 



6 



b) If 



7TB 



< 



and 



l+VK(C(G)g(/3, 
1+W(C(B)€(£- 



,G)) 

TBI) 



< 



7TB 



for 



O 



all /3 G [a, /3 Tj b] then /3 Tj b is a symmetric Wardrop 
equilibrium 

If — < "^"1 and there exists a 8 is the solution 

ttb 

of the following equation 

l + iy(C(G)C(/3,G)) _ ttq 

then f3 is a symmetric Wardrop equilibrium 



iii) If ^ 

' 7TB 



A (G) 

> \ F ■ then the following cases hold 



s : f l + tV(C(G);(a,G)) > ttq f o p r o i fhpn 

a) 11 i+w(c(B)aa,B)) - Tb~ tor a11 P 6 i a > Pt.bJ tnen 
is a symmetric Wardrop equilibrium 

b J 11 i+iy(c(s)€(a,-B)) - ¥b" tor a11 P G [a 5 Pr,Bj, tnen 
there exists a symmetric Wardrop equilibrium which 
is given by 

if tit b < ir G tp^ a (G) 

/3 T , fl if tttb >ir G t^ B {G) (10) 

8* G {0, /3 r , S } if T7TB = 7T G ^ B (G) 

Theorem. [2] displays a structure of the best response that 
is similar to the result obtained for the linear case, but we 
should highlight some differences. First, the additional request 
X ps (G)N < X pu is excluding the case when the effect of the 
pull mechanism is negligible compared to push mechanism. 
This means that we are restricting to the case when the 
aggregated maximum rate at which the viewcount can increase 
due to the push mechanism is smaller than the increase that 
is generated once the viewcount is above threshold for pull 
users. Indeed, this is the interesting case when the content 
provider's aim is to attract a large basin of pull users using a 
target limited audience of push users. 



Second, we observe that the term 



that was 



Ap S (#) + A pu 

present in the linear case is now replaced by a term involving 
the Lambert function W(-) 1151 : this is due to the combined 
effect of the exponential growth and the linear growth above 
the threshold, accounting for the saturation of the basin of 
push users. In the case when N is very large or X ps is very 
small, the term collapses to the condition expressed in the 
linear case. 

IV. Variable time horizon 

In this section, we are interested in the case where the time 
horizon during which the content is accessed by pull users is 
not fixed. But, it is determined by the popularity of the content 
and by the quality perceived by users. In particular, when the 
popularity of a content is subject to saturation, we can model a 
vanishing X to encode the condition when a content which is 
present online for a long time becomes stale. Conversely, fresh 
uptaking contents will experience large values of X and will 
be preferred. This case fits well specific types of contents such 
as news or pop songs, for which the trend of the viewcount 
increase may be the main trigger for the users' interest in some 
content. Pull users still adopt a threshold strategy and browse 
the content if 

X(t,6)> 7th (11) 



Let us consider the exponential push case introduced in the 
previous section. Condition ( fTTT i determines a variable horizon 
to access content 9: 

T{a,e) = x-\ lth ) 

Because the time horizon t = oo for jth < X pu , we restrict 
our analysis to the case when j t h > X pu . 

Again, we are interested to compute the utility function 
for a tagged user given a certain common threshold strategy 
a played by other users; the objective is to compute the 
best response 8 for the tagged user as done before. Let 
X th (6) = N - A, t (9) = —Ulogf^H') and 



Ap 3 (S)' '"V/ X ps («) -° V 7th 

Observe that the interval of time when pull users will access 
the content becomes now [tq(8) ti(8)]: the duration of such 
interval corresponds to the useful lifetime of the content as 
dictated by the interest of the users based on (fTTl i and by the 
content type. 

We distinguish again two intervals, namely < 8 < a and 
ct < 8 < t, and denote 8* and 8\ the best response in those 
intervals, respectively. However, we need to account also for 
( fTTb and to detail the utility accordingly. 

It follows that if 8 > a, then 

U(a, 8) = tt g (n(G) - tp{G)) + - tt b (tx(B) - t (B)) + 

If a > X th (G) and 8 < a 
U(a,B) = 7r G (ro(G)-^(G)) + +7r G (T 1 (G)-t ct (G))" 
-ttb (tq(B) — tp(B)J + — tt b (n(B)-t a (B) 
If X th {B) <a< X th (G) and 8 < a, then 
U{a,B) = 7r G ( n (G)-^(G)) 

-ttb (t {B) -t p (B)Y -ttb (n(S)-t„(B) N 
If X th (B) > a and 8 < a, then 

U(a,B) = 7r G (n(G)-^(G)) -7r fl (Ti(B)-t a (fl)) 

With a similar analysis as that employed in the proof of 
Thmf3] we can write: 

Theorem 3: In the exponential case, under the assumptions 
X ps (G) > X ps (B) and X ps (G)N < X pu , it holds 

• If tt G < ttb then 8 is a symmetric Wardrop equilibrium 
where 8 — 8 T (B) is solution of tp(B) = t 

. It tt g > ttb, — < I -^gy and i +H /(c(S)5( Q ,s)) ^ 7^ 
for all 8 G [a, /3 To B ] then all values in the interval [0, 8] 
are symmetric Wardrop equilibria. 



T f _ . _ ttg ^ Apa (G) , l + W(C(G)gQ3 T ,G)) < 7TG 

It tt g > ttb, — < and TTmmm - m < — 

for all 8 G [a, 8 TQ B (B)] then 8 is a symmetric Wardrop 
equilibrium. 
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Fig. 4: The utility function for N = 1000, for r = 10 days, X ps (G) = 10" 1 views/day, X ps (B) = A ps (G)/10. a) Detail of the discontinuities 
of U(a, j3) for 7 = 0.01, 0.1, 1, where a — 0.18 b) Extremal type of best response for a — 0.029, 7 = 1.5 and under increasing values of 
the belief ttg- c) Same as b) but for 7 = 0.3. Discontinuity in a corresponds to local maxima for tig = 0.25, 0.50. 



A IG) 

If > ttb, — < , ps ; R , and there exists B 3 solution 
of the following equation 

l + W(C(G)af3 S: G)) _ n G 
l + W({(B)Z(p s ,B)) ttb 



then j3 s is a symmetric Wardrop equilibria 



If m. > 



-y s (G) 



and and 



l + Wfc(G)g(a,G)) 



> m for all 



ttb ' X PS (B) l+W(((B)£(a,B)) - 7T B 

(3 6 [a, (3 Tg B ] then is a symmetric Wardrop equilibrium. 

11 > a^(S) and and i+w(c(B)£( a ,B)) S ^ tor all 
/3 G [a,/3 T0B ] then there exists a symmetric Wardrop 
equilibrium which is given by 





G {0,/3 TO (B)} 



if Tir B < TTGt^ B 

if TIT b > TT G t^ B 
if TIT b = nGti3 T o,B 



(12) 



The overall result in Thmj3] shows a structure that is close 
to that obtained in Thm. [2] We can conclude that the presence 
of a selective preference expressed in terms of the viewcount 
trend does not affect the structure of the Wardrop equilibria. 
In fact, they are of the kind determined before in the case of 
a fixed length interval: either extremal ones or a continuum of 
such restpoints. It is interesting to notice that this is following 
irrespective of the fact that the utility function is linear as a 
function of the "viewing time", i.e., the time that is useful for 
the viewers, but, pull users' preferences depend on a non-linear 
function of the threshold type. 

V. Combined effect of Trend and Viewcount 

In general, contents that are present online since a long 
time display different popularity than contents which last only 
a short time Q. As we noticed in the previous section, 
when popularity saturation occurs, X vanishes for large t. 
If users choose among contents with different trend and 
different viewcount, they would naturally choose a content 
with large viewcount and large increasing trend. To this respect 
y(t) = X(t)X(t) encodes the condition when the pull user 
still values the viewcount, but, she favors a large increasing 
trend given two contents with the same viewcount. 



Symmetric equilibria can be determined when in the system 
all users adopt a strategy 



y(t c 



0<t a <T 



and again we determine the best response for a user deviating 
using (3 := y{tp) as a reply, where < tp < t. 

It is easy to see that in the linear case, the model developed 
in the previous section applies as long as one replaces the 
dynamics with the one below 

X ps (t, 0) = Xp S (6)t + X ps (9), X pu = Xpuit — ta) -l(t — t a ) 

so that all the results can be specialized accordingly replacing 
X ps and X pu with Xp S and Xp U wherever they appear. The 
intuition is that when the regime of content diffusion is linear, 
i.e., when a large number of push users exists, the trend 
of popularity has the only effect to reinforce the inequality 
Xp S (B) X ps (G). We then move to a more interesting case. 

A. Exponential push case 

In the exponential case, the dynamics again is the same 
captured by @, Q. We can specialize the analysis to the two 
cases as done before. If a > j3, y{tp) = j3 implies that 

(3 = X ps {0)N 2 (l - e-W^Je-^W 1 * 

where the solution is such that tp — —j^f(/3,9) where we 



let /OS, 0) :=log i 1 



1 - 



4/3 



X P s(0)N 2 



After observing that /(0, B) = and f{(3, G) < f(0, B) < 0, 



again we obtain two extremal cases: when A ^jq) > A ^jm 
then U(a,j3) — (ttq — ttb)t < so that j3 = maximizes 



the utility. In the opposite case, namely, A e 9q\ < j — jb)' 
U(a, j3) ~ (ttg — itb)t > 0, so that f3 = a does. 
If a < f3, the condition for 

/3 = X(t)(x ps (9)Ne- x ^ + A pu ) 



gives: tp=t a (6) 



N 



£(£)e-2£(B) 



2000 



1000 



d. 



-1000 



-2000, 
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Fig. 5: The shape of function /3i(X pu ) for increasing values of X pu : 
the vertical asymptote corresponds to the value X pu . 



where we used the definition of We w = x and we stressed 
the dependence of t a on 9. It is important to notice that in this 
case, tp is not continuous, so that in correspondence of t a (G) 
and t a (B) the utility function has possibly two discontinuities. 
We reported in Fig. Et a ) the shape of the utility function for 
increasing values of 7 = 0.01,0.1,1 X pu — r yNX ps (G). For 
larger values of 7 the effect of discontinuities becomes negli- 
gible with respect to the shape of the utility function (indeed 
we are looking for the best response, i.e., the maximum of 
U(a,ft)). 

In particular, we observe in Fig. SJb) that for the choice of 
parameters there, i.e., 7 = 1.5, the shape of the utility function 
leads again to the customary extremal type of best response 
that we observed in the linear case. That is, access at time 
t = 0, i.e., ft = ft max for large itq and access at time tp = t, 
i.e., ft = for smaller values of -kq. However, for 7 = 0.3, 
see Fig. Etc), we find Wardrop equilibria {(3* (a) = a) in the 
interior of [0,/3 max ]. Further numerical exploration confirmed 
that the equilibria form an interval. Thus, again, we find that 
there exist conditions (in this case, smaller X pu ) when the 
system has a continuum of equilibria as in previous cases. 

VI. USERS WITH SIDE INFORMATION 

In the previous section we have considered the product of 
the trend and magnitude of the viewcount as a metric: as 
seen there, the structure of the equilibria that we can expect 
resembles closely what we found in the previous cases: either 
extremal Wardrop equilibria or a continuum of restpoints. We 
want to describe the case when potential viewers may be 
provided additional information on the upcoming popularity 
of a certain content, e.g., relying on some predictors or some 
apriori information they have. They judge whether to access 
or not a given content based on the product of the popularity 
X and the popularity trend X. But, they only know how such 
metric is going to accumulate over time, i.e., the metric for a 
user that approaches the content at time t is 



y(t) = J X{u)X{u)du 



1 



{X\r)-X\t)) 



This metric can be used as a simple benchmark case: it 
contains information on the future dynamics of X(ff), and it is 
defined by the current and the final values of the viewcount. 



However, the amount of such information in general is not 
sufficient at time t to state the type of the content. Of course, 
more sophisticated metrics are possible. Nevertheless, the one 
at hand will do for the purpose of showing that by making the 
potential viewers of a content aware of some side information, 
the system may experience a deep change in the structure of 
the equilibria. 

Let all users adopt strategy 

a := y(t a ), < t a < r 

and in the same way as done before we want to determine the 
best response for a user adopting ft := y(tp) as a reply, where 
< tp < T. 

In the case (3 > a, we recall that the dynamics is 

X(t,6) = a + X(6)(t-t a ) 
where X(9) := (X pu + X ps (9)) for the sake of notation, so that 

a + X(6)(tp - t a ) = ^X 2 (t, 9) -2(3 

which solves for tp = ^-(a^^r + y/X 2 (r, 9) -2ft). 
The corresponding expression for the utility is U (a, (3) = 



U (a,ft) 



ir G ^/X^T,G)-2f3 ^ Bv /(X2(r, J B)-2/3) 



KG) 

where the term Uq(oi, (3) = (ttg — itb)t — aX v 



X(B) 



P u \X ps (G)X(G) 



and it turns out that 



X ps (B)\(B) 

dU(a,/3) _ 

dp ~ A(G)(X 2 (r, G) - 2/3)^ ~ X(B)(X 2 (t, B)-2/3)i 

which is decreasing with (3 6 [— oo,/? Tj s], where /3 Tj b '■= 
\X{t, B) as follows by comparing the ratio of the two positive 
terms appearing in the expression above under the assumption 
X(t,G) > X(t,B)). When ^ ^ ^ the U(-,j3) over R 
attains a unique maximum at 



1 -X\t 1 G){^ ] ) 2 + X\t,B){ 



KG 

MG] 



so that there exists also one maximum of U(a,f3) in [t Q ,r]. 
We can distinguish three cases based on the fact that 

1) Pi < ol: the best response in this case is (3*{a) = a 

2) a < Pi < Pt.b- the best response is P*(a) = Pi 

3) Pi > Pt,b'- the best response in this case is ft* (a) = 

ftr,B- 

Finally, we notice that when = jjjpj, case 1) applies. 

In the case ft < a, we can derive a similar analysis starting 
from the dynamics X(t, 9) — X ps (9)t, so that 

P = v{tp) = 1(X 2 (T,6)-Xl(6)tf 



so that tp = ^/X 2 {t, 9) - 2/3, and 
U(a,ft) = (ir G - tt b )t 

\ G ^/X 2 (r,G)~2ft 7r B ^/(X 2 (T,B)-2ft) 



^ps(G) 



X P s(B) 
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In turn, we can recognize the same structure for the best 
response as in the previous case, where the maximum of 
U(-,/3) (when ^ x^fej)' over M is attained at 



ft 



( -JLQ_\ 2 _ I TO V 

\X n JG)) \X n JB)J 



.A SS (G), 



.X pa (G)J \X ps (B), 

and the three cases write 

1) ft < 0: the best response in this case is ft (a) = 0. 

2) < ft < a: the best response is ft (a) = ft. 

3) ft > a: the best response is ft (a) = a. 
Again, when j^fey = jZfe, case 1) applies. 

Now, to complete our analysis, we need to determine the 
best response between the two cases: we need to detail the 
relation between ft and ft. To so do we can rewrite for the 
sake of convenience 



1 tt 2 g x 2 X 2 (t, B) - tt|(L + x) 2 X 2 (t, G) 

2 ir 2 B x 2 - ir 2 G {L + x) 2 



where L = X ps (G) — X ps (B) and x = X ps (G) + X pu . It can 
be easily showed that 



-ft(*) 



a 2 2Lx(X 2 (r,G)-X 2 (r,i?))(x-L) 



drc"^ ' GB " (7r|x 2 ~7r G (L + x) 2 ) 2 
which brings -§^Pi{x) > for x > 0, with a singularity in 



pu 



KB 



■(X ps (G) — X ps (B)) — X ps (B) 



KG — ^B 

The typical shape of ft is reported in Fig. [5] We observe that 
ft(A pu = 0) = ft. The asymptotic value for X pu = oo is 

,_l 7rSX a (r,B)-7r|X a (r,G) 

It can be verified that f3i(X pu ) is injective. Hence, the above 
analysis let us state: ft (oo) < ft(0) = ,$2, which in turn leads 
to the following 

Lemma 4: For < X pu < X pu , it holds ft > ft, and for 
X pu > X s pu it holds ft < ft. 

Now we can combine the conditions above to derive: 

Theorem 4: Let / = [0,/3 T , B ] 

i. If A p „ > X pu , then 

= [9i,A,]nJ 

is the set of symmetric Wardrop equilibria for the system. 

ii. If X pu < X s pu then W s C {0,/3 T , B }. 

Proof: Case i. follows immediately observing that for 
j3i < a the best response is f3*{a) = a and for P2 > a 
the best response is f3*(a) — a: both conditions are satisfied 
simultaneously for a > if and only if a £ W s . 
Case ii. is proved observing that the conditions for case i fail, 
so that only extremal cases can hold. In particular, W s is not 
always the empty set: if /3 2 > 0, then /3i < so that /3*(0) = 
and the same holds in the opposite case, i.e., if /3i > a = j3 Tt B 
then Pi > a = j3 T . B so that /3*(j3 TiB ) = Pt,b- ■ 
The result in Thm. [4] let us observe a neat phase transition 
effect on X pu : when the intensity of the views due to the pull 



mechanism is below threshold X' pu , only extremal Wardrop 
equilibria are possible. Above that threshold, there can exist 
a continuum of equilibria where the system can settle. Let 
/i(-) denote the standard real measure: a sufficient condition 
is provided in the following 

Corollary 1: fi(W s ) > if X pu > X pu and /3 2 > > ft. 
We can observe that ttq < "Kb implies > and A pu > > 
X pu , so that a stronger sufficient condition than the one just 
provided in turn becomes: ttq < ttb and ft < ft, b- 

VII. Related Works 

The analysis of dynamics of popularity of online contents 
has been subject of recent papers. The work |]3] provides an 
analysis of the YouTube system, with comprehensive view of 
the characteristic of the generated traffic. 

In Q the authors address the relation between metrics 
used to evaluate popularity. They observed that viewcount is 
strongly correlated with several such metrics as number of 
comments, ratings, or favorites. However, all such metrics do 
not correlate to average rating. In this paper we confine our 
analysis to viewcount as the metric of interest. J7J focuses 
on the core problem of predicting popularity, namely, the 
viewcount, based on early measurements of user access. Based 
on YouTube videos or Digg stories measurements, the authors 
observe that contents increasing fast their viewcount in early 
stages typically become popular later on. The proposed em- 
pirical model, i.e., logiV(i r ) = \ogN(t ) + Xo(t r ,to) where 
Xo(t r ,to), is a random multiplicative noise and N(t r ), N(to) 
is the viewcount at t r and to; it resembles closely the expo- 
nential model adopted in this work. 

In O the authors propose a model accounting for change of 
ranking induced by UGC online platforms. The model is meant 
to overcome the limitations of the preferential attachment 
models. Those models in fact cannot explain bursty growth 
of content popularity; those in turn are claimed an inherent 
property of the online platforms. The authors relate bursty 
growth spikes to the way such systems expose popular contents 
to users and perform re-ranking of existing contents causing 
positive feedback loops. 

The paper (6} provides analysis of power law behavior 
for the rank distribution of contents; the distribution of most 
watched videos is found heavily skewed towards the most 
popular ones. 

Threshold models similar to those studied in this work 
are described by Granovetter fTTTI in social science. The 
assumption is that individuals make binary decisions (in our 
framework, view or not view a content), according to some 
static internal threshold of others participating. A generaliza- 
tion based on threshold distribution is addressed in J9]- 

VIII. Conclusions 

In this paper we characterized the access to online contents 
by game theoretical means by leveraging on the concept of 
Wardrop equilibrium. We deduced the structure of equilibria 
in systems where users adopt threshold type policies to select 
online contents. We explored several cases: the case when the 
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plain viewcount is the metric, or the viewcount trend, or both 
are combined as a product metric. We explored the case of 
a fixed time horizon dictated by the content lifetime, and we 
considered a case when the time horizon is not fixed. Finally 
we explored the impact of side information available to users. 

In all such cases we deduced the presence of a continuum of 
equilibria, which has potential implications in the design and 
control of platforms for online content access. In future work, 
in particular, we are exploring the dynamics associated to such 
sets of interior restpoints, when they exist, and comparing 
those with typical dynamics of online contents. However, 
not only equilibria are relevant: as showed in liTOl . threshold 
strategies, under specific conditions, may well lead the system 
to be asymptotically unstable; system trajectories may in turn 
consist of cycles that can move into a chaotic dynamics, 
essentially indistinguishable from random noise. 
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